NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Bayesian Proof of the Spread Lemma

https://doi.org/10.1002/rsa.70008

Mossel, Elchanan; Niles‐Weed, Jonathan; Sun, Nike; Zadik, Ilias (July 2025, Random Structures & Algorithms)

ABSTRACT A key set‐theoretic “spread” lemma has been central to two recent celebrated results in combinatorics: the recent improvements on the sunflower conjecture by Alweiss, Lovett, Wu, and Zhang; and the proof of the fractional Kahn–Kalai conjecture by Frankston, Kahn, Narayanan, and Park. In this work, we present a new proof of the spread lemma, that—perhaps surprisingly—takes advantage of an explicit recasting of the proof in the language of Bayesian inference. We show that from this viewpoint the reasoning proceeds in a straightforward and principled probabilistic manner, leading to a truncated second moment calculation which concludes the proof.
more » « less
Free, publicly-accessible full text available July 1, 2026
Sharp thresholds in inference of planted subgraphs

https://doi.org/10.1214/24-AAP2120

Mossel, Elchanan; Niles-Weed, Jonathan; Sohn, Youngtak; Sun, Nike; Zadik, Ilias (February 2025, The Annals of Applied Probability)

Free, publicly-accessible full text available February 1, 2026
It Was “All” for “Nothing”: Sharp Phase Transitions for Noiseless Discrete Channels

https://doi.org/10.1109/TIT.2022.3225802

Niles-Weed, Jonathan; Zadik, Ilias (August 2023, IEEE Transactions on Information Theory)

Full Text Available
Lattice-Based Methods surpass sum-of-squares in clustering

Zadik, Ilias; Song, Min Jae; Wein, Alex; Bruna, Joan (July 2022, Conference on Learning theory (COLT))

Full Text Available
On the Cryptographic Hardness of Learning Single Periodic Neurons

Song, Min Jae; Zadik, Ilias; Bruna, Joan (December 2021, Advances in neural information processing systems)
null (Ed.)
Abstract We show a simple reduction which demonstrates the cryptographic hardness of learning a single periodic neuron over isotropic Gaussian distributions in the presence of noise. More precisely, our reduction shows that any polynomial-time algorithm (not necessarily gradientbased) for learning such functions under small noise implies a polynomial-time quantum algorithm for solving worst-case lattice problems, whose hardness form the foundation of lattice-based cryptography. Our core hard family of functions, which are well-approximated by one-layer neural networks, take the general form of a univariate periodic function applied to an affine projection of the data. These functions have appeared in previous seminal works which demonstrate their hardness against gradient-based (Shamir’18), and Statistical Query (SQ) algorithms (Song et al.’17). We show that if (polynomially) small noise is added to the labels, the intractability of learning these functions applies to all polynomial-time algorithms, beyond gradient-based and SQ algorithms, under the aforementioned cryptographic assumptions. Moreover, we demonstrate the necessity of noise in the hardness result by designing a polynomial-time algorithm for learning certain families of such functions under exponentially small adversarial noise. Our proposed algorithm is not a gradient-based or an SQ algorithm, but is rather based on the celebrated Lenstra-Lenstra-Lovász (LLL) lattice basis reduction algorithm. Furthermore, in the absence of noise, this algorithm can be directly applied to solve CLWE detection (Bruna et al.’21) and phase retrieval with an optimal sample complexity of d + 1 samples. In the former case, this improves upon the quadratic-in-d sample complexity required in (Bruna et al.’21).
more » « less
Full Text Available
Self-Regularity of Non-Negative Output Weightsfor Overparameterized Two-Layer Neural Networks

Gamarnik, David; Kızıldağ, Eren C.; Zadik, Ilias (January 2021, International Symposium on Information Theory)
null (Ed.)
We consider the problem of finding a two-layer neural network with sigmoid, rectified linear unit (ReLU), or binary step activation functions that "fits" a training data set as accurately as possible as quantified by the training error; and study the following question: \emph{does a low training error guarantee that the norm of the output layer (outer norm) itself is small?} We answer affirmatively this question for the case of non-negative output weights. Using a simple covering number argument, we establish that under quite mild distributional assumptions on the input/label pairs; any such network achieving a small training error on polynomially many data necessarily has a well-controlled outer norm. Notably, our results (a) have a polynomial (in d) sample complexity, (b) are independent of the number of hidden units (which can potentially be very high), (c) are oblivious to the training algorithm; and (d) require quite mild assumptions on the data (in particular the input vector X∈ℝd need not have independent coordinates). We then leverage our bounds to establish generalization guarantees for such networks through \emph{fat-shattering dimension}, a scale-sensitive measure of the complexity class that the network architectures we investigate belong to. Notably, our generalization bounds also have good sample complexity (polynomials in d with a low degree), and are in fact near-linear for some important cases of interest.
more » « less
Full Text Available
On the Cryptographic Hardness of Learning Single Periodic Neurons

Song, Min Jae; Zadik, Ilias; Bruna, Joan (December 2020, NeurIPS 2021)
null (Ed.)
Full Text Available
Free Energy Wells and Overlap Gap Property in Sparse PCA

Ben Arous, Gerard; Wein, Alexander S; Zadik, Ilias (July 2020, Proceedings of Thirty Third Conference on Learning Theory)
Abernethy, Jacob; Agarwal, Shivani (Ed.)
We study a variant of the sparse PCA (principal component analysis) problem in the “hard” regime, where the inference task is possible yet no polynomial-time algorithm is known to exist. Prior work, based on the low-degree likelihood ratio, has conjectured a precise expression for the best possible (sub-exponential) runtime throughout the hard regime. Following instead a statistical physics inspired point of view, we show bounds on the depth of free energy wells for various Gibbs measures naturally associated to the problem. These free energy wells imply hitting time lower bounds that corroborate the low-degree conjecture: we show that a class of natural MCMC (Markov chain Monte Carlo) methods (with worst-case initialization) cannot solve sparse PCA with less than the conjectured runtime. These lower bounds apply to a wide range of values for two tuning parameters: temperature and sparsity misparametrization. Finally, we prove that the Overlap Gap Property (OGP), a structural property that implies failure of certain local search algorithms, holds in a significant part of the hard regime.
more » « less
Full Text Available
The all-or-nothing phenomenon in sparse linear regression

https://doi.org/10.4171/MSL/22

Reeves, Galen; Xu, Jiaming; Zadik, Ilias (January 2020, Mathematical Statistics and Learning)

Full Text Available
All-or-Nothing Phenomena: From Single-Letter to High Dimensions

https://doi.org/10.1109/CAMSAP45676.2019.9022473

Reeves, Galen; Xu, Jiaming; Zadik, Ilias (December 2019, 2019 IEEE 8th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP))

We consider the problem of estimating a $$p$$ -dimensional vector $$\beta$$ from $$n$$ observations $$Y=X\beta+W$$ , where $$\beta_{j}\mathop{\sim}^{\mathrm{i.i.d}.}\pi$$ for a real-valued distribution $$\pi$$ with zero mean and unit variance’ $$X_{ij}\mathop{\sim}^{\mathrm{i.i.d}.}\mathcal{N}(0,1)$$ , and $$W_{i}\mathop{\sim}^{\mathrm{i.i.d}.}\mathcal{N}(0,\ \sigma^{2})$$ . In the asymptotic regime where $$n/p\rightarrow\delta$$ and $$p/\sigma^{2}\rightarrow$$ snr for two fixed constants $$\delta,\ \mathsf{snr}\in(0,\ \infty)$$ as $$p\rightarrow\infty$$ , the limiting (normalized) minimum mean-squared error (MMSE) has been characterized by a single-letter (additive Gaussian scalar) channel. In this paper, we show that if the MMSE function of the single-letter channel converges to a step function, then the limiting MMSE of estimating $$\beta$$ converges to a step function which jumps from 1 to 0 at a critical threshold. Moreover, we establish that the limiting mean-squared error of the (MSE-optimal) approximate message passing algorithm also converges to a step function with a larger threshold, providing evidence for the presence of a computational-statistical gap between the two thresholds.
more » « less
Full Text Available

« Prev Next »

Search for: All records